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1 Introduction 

More than a century has passed since Takagi [75] published his simple example of 
a continuous but nowhere differentiable function, yet Takagi's function - as it is 
now commonly referred to despite repeated rediscovery by mathematicians in the 
West - continues to inspire, fascinate and puzzle researchers as never before. For 
this reason, and also because we have noticed that many aspects of the Takagi 
function continue to be rediscovered with alarming frequency, we feel the time has 
come for a comprehensive review of the literature. Our goal is not only to give an 
overview of the history and known characteristics of the function, but also to discuss 
some of the fascinating applications it has found - some quite recently! - in such 
diverse areas of mathematics as number theory, combinatorics, and analysis. We 
also include a section on generalizations and variations of the Takagi function. In 
view of the overwhelming amount of literature, however, we have chosen to limit 
ourselves to functions based on the "tent map". In particular, this paper shall not 
make more than a passing mention of the Weierstrass function and is not intended 
as a general overview of continuous nowhere-differentiable functions. We thank Prof. 
Paul Humke for encouraging us to write this survey, and for issuing periodic cheerful 
reminders. 



1.1 Early history 

Takagi's function is indeed simple: in modern notation, it is defined by 

T(x) = Y, ¥ <P(2 n x)> (1-1) 

n=0 
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Figure 1: Graph of the Takagi function 



where 4>(x) = dist(x, Z), the distance from x to the nearest integer. The graph of 
T is shown in Figure [U Takagi himself expressed his function differently, and this 
is perhaps one reason (in combination with Japan's isolation at the beginning of 
the twentieth century) why it was largely overlooked in the West. Unlike for the 
more famous Weierstrass function, it is easy to show that T has at no point a finite 
derivative; we include the short proof due to Billingsley [17] in Section 12.11 However, 
it does possess an infinite derivative at many points, and for this reason Knopp 
[3D] , in his 1918 review of the rapidly growing body of "strange" functions, did not 
consider it truely nowhere differentiable. Knopp outlined his own geometric method 
for producing functions which have no derivative, finite or infinite, at any point. The 
example most similar to the Takagi function is 

oo 

/(*) = jV^ar), 

n=0 

where < a < 1, b is an integer and ab > 4. Knopp's general construction also 
includes the Weierstrass function and Faber's example [26] as special cases. 

A variant of Takagi's function, using the base 10, was discovered in 1930 by Van 
der Waerden, who credits Dr. A. Heyting for sending in the proof of its nondiffer- 
entiability, in response to a problem in the publication Wiskundige Opgaven of the 
Dutch Mathematical Society. Three years later, Hildebrandt [33] showed that one 
can more simply use the base 2, thus rediscovering Takagi's original example. An 
editorial note affixed to Hildebrandt's paper solicited answers to the "interesting and 
probably not too difficult" question at which set of points T(x) has an infinite deriva- 
tive. Surprisingly, it would take 77 years for this natural question to be answered 
correctly, perhaps because the answer is not all that easy to guess; see Section 12.11 
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below. 

In 1939 the Takagi function was rediscovered by Tambs-Lyche [76], who was also 
inspired by Van der Waerden's paper and motivated by a desire to give "an example 
easy to understand for beginning students of analysis". Tambs-Lyche defined it by 
a formula different from both (II. lft and Takagi's original definition (see Section Hj), 
and stated without proof its equivalence to (11.11) . Tambs-Lyche was also the first to 
publish a graph of Takagi's function, hand-drawn but remarkably accurate. 

Apparently unaware of Hildebrandt's and Tambs-Lyche's notes, de Rham [62] re- 
discovered Takagi's function once more in 1957. His main contribution, however, was 
to identify T as a member of a more general class of functions which are solutions to 
a certain family of functional equations; in today's language, de Rham observed that 
the Takagi function is self-affine. His paper soon inspired Kahane [31] to determine 
the points of global and local extremum of T, and to modify the definition (II. ip in 
order to create functions with a prescribed modulus of continuity. 

By the 1960s, the Takagi-van der Waerden function was sufficiently well known 
that it could be used as the key element in solutions to other problems, both in 
classical real analysis and in number theory. Lipinski [53J used it in his elegant 
characterization of zero sets of continuous nowhere differentiable functions, after 
Schubert [66] had started the investigation. And Trollope [77] observed that the 
Takagi function was the missing piece of the puzzle in the binary digital sum problem; 
his proof was simplified and extended further by Delange [25]. These applications 
are described in detail in Sections 18.11 and 18. 3[ respectively. 

1.2 The Takagi function comes home 

Interest in the Takagi-van der Waerden function spiked after 1980, with two more 
or less independent streams of publications. In the West, Billingsley [17] drew new 
attention to the function with his short note in the American Mathematical Monthly, 
providing perhaps the most lucid proof of the function's nowhere differentiability. His 
argument was modified by Cater [22] to show that T does not even possess a finite 
one-sided derivative anywhere, and Shidfar and Sabetfakhri [68] proved that T is 
Lipschitz of every order a < 1. A sharper result was obtained by Mauldin and 
Williams [59], who investigated a much larger class of functions defined by infinite 
series and showed that the Takagi function is "convex Lipschitz" of order Mog(l / h) . 
Anderson and Pitt [12J slightly improved on this by showing that 

T(x + h)- T(x) = 0(hlog(l/\h\), as /i ^ 0, 

and this estimate is the best possible. As a result, the Hausdorff dimension of the 
graph of T is one. 
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Meanwhile, the Takagi function had become popular in the country of its birth, 
due to the influential paper by Hata and Yamaguti [31]. Besides finally restor- 
ing credit to its original inventor, these authors did much to elevate Takagi's func- 
tion beyond the realm of recreational mathematics, by pointing out its connection 
with chaotic dynamical systems and proving a beautiful relationship between T and 
Lebesgue's singular function (also called Salem's function or the Riesz-Nagy func- 
tion). Hata and Yamaguti also replaced the factor l/2 n in fll.ip by an arbitrary 
constant c n , calling their new family of functions the Takagi class. Kono [H] char- 
acterized completely the differentiability properties of members of the Takagi class - 
there are three qualitatively different cases - and proved several other results about 
these functions, most of them of a probabilistic nature. Gamkrelidze [29] later applied 
Kono's methods to obtain a Central Limit Theorem-type result for the small-scale os- 
cillations of T. In the same year as Hata and Yamaguti's paper, Baba [13] calculated 
the maxima of the general Takagi-van der Waerden function (with arbitrary base 
r > 2), after Martynov [57] had rediscovered Kahane's result about the maximum of 
T. Tsujii [78] constructed a Takagi-like function of two variables, and Yamaguchi et 
al. [81] viewed the graph of T as the invariant repeller of a dynamical system. 

1.3 Recent work 

In the last two decades, the literature on the Takagi function and related topics 
seems to have grown exponentially. Papers from this period can be loosely classified 
into three categories: papers about the Takagi function itself, papers dealing primar- 
ily with applications, and papers discussing various generalizations and variations. 
Some papers fit more than one category. It is impossible to describe each individual 
contribution in this introduction. We limit ourselves here to succinct groupings of 
papers by topic, referring to later sections for the details. 

1. Papers about the Takagi function itself. These can be further divided as follows. 
Kairies et al. [36] and Kairies [35J characterize T by its functional equations. Other 
papers, such as Brown and Kozlowski [19], Abbott et al. p] and Watanabe [80J, 
focus on various local continuity properties. The infinite derivatives of T are dealt 
with in Kriippel [I5j H7] and Allaart and Kawamura [8 J . There is also a sustained 
effort ongoing to understand the complicated level set structure of T; see Buczolich 
[2Tj . Maddock [55], Lagarias and Maddock [501 [51], Allaart [5j |6], and de Amo et 
al. [9j. A richly illustrated expository article by Martynov [58] gives step-by-step 
explanations of the main characteristics of T, aimed at undergraduate students. 

2. Papers concerned with applications. A number of authors have extended 
Trollope's result about binary digital sums in various directions. The papers most 
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closely related to the Takagi function are Okada et al. [61], Kobayashi [42] and 
Kriippel [16]. Recently Hazy and Pales [32] and Boros [18] found Takagi's function 
to be the extremal case in the theory of approximately midconvex functions. Their 
work was elaborated on by Tabor and Tabor [731 an d Mako and Pales [56J, 
and this last paper includes many further references. Allaart [1] reduces the crucial 
inequality in the above papers to a simple inequality for binary digital sums, thus 
linking the two applications. Takagi's function also arises naturally as the limit in 
certain counting problems in graph theory; see Frankl et al. [28], Knuth [JT] and Guu 
[30] . It even has been used in an equivalent statement of the Riemann hypothesis; 
see Balasubramanian et al. [H]. 

3. Papers about generalizations and variations. There are literally hundreds of 
papers about generalizations of the Takagi function. Many replace the tent map 
with a more general bounded "base" function; some also introduce random phase 
shifts. We will not discuss such functions here, but limit ourselves to generalizations 
based on the tent map. The most direct extension, a subcollection of the Takagi 
class, is the family of functions f a (x) = Yl^Lo Q;Tl 0(2 n ^) where < a < 1. They were 
studied for a > 1/2 by Ledrappier [52], who computed their Hausdorff dimension, 
and for a < 1/2 by Tabor and Tabor [731 El] an d Allaart [1]. Other extensions allow 
the "tents" at each stage of the construction to be flipped up or down individually. 
This way one obtains for instance the Gray Takagi function of Kobayashi [42] or the 
function T 3 of Kawamura [38] . Anderson and Pitt [12] , Abbott et al. pQ and Allaart 
[3] investigate general properties of this larger class of functions. Sekiguchi and 
Shiota [67], generalizing the work of Hata and Yamaguti, obtained another family of 
continuous functions, which were examined more closely by Allaart and Kawamura 
[7J. A version of the Takagi function with random signs is studied in Allaart [2], 
and Kawamura |39J considers the composition of T with a singular function. Finally, 
Sumi [72] introduces a complex version of the Takagi function in connection with 
random dynamics in the complex plane. 

1.4 Organization of this paper 

This survey is organized as follows. Section [2] focuses on analytic aspects of the 
Takagi function. We give Billingsley's proof of the nowhere-differentiability of T and 
characterize the set of points where T has an infinite derivative. In Section 12.21 we 
treat the Holder continuity of T and explain the work of Abbott et al. [1] regarding 
slow points. This is followed by a more detailed examination of the modulus of 
continuity of T. 

Section [3] deals with the graph of T. We first discuss the global and local extrema 
of T. Then we point out the partial self-similarity of the graph and illustrate how to 
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use this to prove a specific theorem, namely that the graph of T has cx-finite linear 
measure. 

In Section H] we give a number of different expressions for T(x), show how these 
can be derived from one another, and explain how they have been used to prove 
various aspects of the Takagi function. 

Section [5]gives functional equations for T, presents T as the unique bounded solu- 
tion of a system of infinitely many difference equations, and discusses the connection 
of T with Lebesgue's singular function. 

Section |6] is devoted to the level sets of T. This area of research is currently very 
active: Nearly all the results in this section were found in the last five years or so. 
We outline a proof, based on the partial self-similarity ideas of Section [3j of the fact 
that almost all level sets of T are finite, and give an overview of the other known 
facts about the level sets. The section ends with a list of open problems. 

Section [7] gives an overview of some of the generalizations of the Takagi function 
and other related functions. This includes the general Takagi- van der Waerden func- 
tions, the Takagi class, and the Zygmund spaces A^, and A^ 1; all of which are in 
some sense fairly direct extensions of the Takagi function. This section too concludes 
with a list of open problems. 

Section |8] deals with applications, and is divided into four parts. Subsection 18.11 
presents Trollope's formula for the sum of binary digits of the first N positive inte- 
gers and discusses several related results. In Subsection 18.21 we treat applications of 
the Takagi function to the problem of finding the minimum shadow size in uniform 
hypergraphs and to the edge-discrete isoperimetric problem on the rz-cube. Subsec- 
tion 18.31 deals with applications in classical real analysis and consists of two parts: 
one on the use of T in Lipinski's characterization of zero sets of continuous nowhere 
differentiable functions, and one on the role of T and its generalizations in approx- 
imate convexity problems. Finally, Subsection 18.41 explains the connection between 
Takagi's function and the Riemann hypothesis. 

We have not attempted to give equal coverage to all the players in this arena. 
The things we have chosen to emphasize reflect our interests and expertise, not the 
importance or quality of the cited works. 

While we were preparing this article, we learned that Jeffrey Lagarias [49J was 
working on a survey paper of his own. The two surveys evolved for the most part 
independently, and while there is inevitably a considerable degree of overlap, the two 
surveys emphasize different things. For example, we treat in detail the differentiabil- 
ity aspects and fine structure of the graph of T, and discuss various generalizations 
and applications in considerable detail. (Hence the length of the last two sections 
of this paper.) Lagarias, on the other hand, focuses on connections of the Takagi 
function with several areas of analysis, including wavelets, complex power series, 
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and dynamical systems. In view of this, we feel that our survey and that of Lagarias 
complement each other quite well. 



1.5 Frequently used notation 

We collect here some notation that will be used regularly throughout this paper. 
First, for definiteness, we let N denote the set of natural numbers, and Z + the set of 
nonnegative integers. Most important is the binary expansion of a point x G [0, 1), 
which we denote by 

oo 

x = 5^2^ = °- £ i £ 2- e n e{0, 1}. 

n=l 

For dyadic rational x (i.e. x of the form x = k/2 m with k G Z + and m G N) we 
choose the representation ending in all zeros. When necessary, to avoid confusion, 
we write e n (x) instead of e n . 

Let I n := I n (x) be the number of ones, and O n := O n (x) the number of zeros, 
among the first n binary digits of x, and let D n := D n (x) := O n (x) — I n (x). Thus, 
we have 



k=i 



and 



If the limit 



D n = J2(l-2e k ) = J2(- 1 Y k - 

k=l k=l 
1 n 

di(x) := lim - S^e k , (1.2) 



n— too n 

k=l 



exists, we call di(x) the density (or long-run frequency) of the digit "1" in the binary 
expansion of x. In that case, the number 

d (x) := 1 — di(x) 

is the density of the digit "0" . 

The orthogonal projections onto the x- and y-axes will be denoted by n x and 7Ty, 
respectively. 

By % a we will denote a- dimensional Hausdorff measure, and by dim^ A, the 
Hausdorff dimension of a set A. For a function /, dim H (f) denotes the Hausdorff 
dimension of the graph of /. 
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2 Analytic properties 



In this section we focus on analytic aspects of the Takagi function, including infinite 
derivatives, Holder continuity and slow points. We begin with a short proof of the 
function ' s nowhere-different iability. 

2.1 Derivatives, or lack thereof 

Takagi himself gave a proof of the fact that T has nowhere a finite derivative [75] , 
as did Hildebrandt [33J and de Rham |62j. Van der Waerden's simple proof for the 
base 10 case, however, does not immediately transfer to the case of base 2. While all 
published proofs of non-differentiability follow more or less the same logic, the one 
by Billingsley [IT] is arguably the most natural, and that is the one we present here. 

Theorem 2.1 (Takagi). The Takagi function T does not possess a finite derivative 
at any point. 

Proof. (Billingsley) Put 0fc(x) = 2~ k <p(2 k x) for k — 0,1, ... . Fix a point x, and, for 
each n e N, let u n and v n be dyadic rationals of order n with v n — u n = 2~ n and 
u n < x < v n . Then 

T(v n ) - T(u n ) 
v n u n 

since 4>k{u n ) = (f>k(v n ) = for all k > n. But for k < n, <fik is linear on [u n , v n ] with 
slope 4>^(x), the right-hand derivative of 0& at x. Thus, 

T(v n ) - T(u n ) 

Since <fi~^{x) = ±1 for each k, this last sum cannot converge to a finite limit. Hence, 
T does not have a finite derivative at x. □ 

Billingsley's argument was modified by Cater [22J to show that T does not have 
a finite one-sided derivative anywhere. The above proof makes it plausible, however, 
that there exist points with T'(x) = ±oo. An Editor's note affixed to Hildebrandt's 
paper asked readers to characterize the set of such points. The call was answered 
three years later by Begle and Ayres [15], who claimed that T'{x) = oo if D n (x) — > oo, 
and T'(x) = — oo if D n (x) — > —oo. This is certainly believable at first sight: If we 



E 

k=0 



4>k{Vn) ~ 4>k{u r , 



U, 



n—1 
k=0 
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agree that for dyadic rational points we choose the binary expansion ending in all 
zeros, then the last equation in the above proof can be written as 



nVn)-T(u n ) =Dn{x) 



Ur 



Convergence of the above slopes to ±00 is necessary in order that T'(x) = ±00, 
but it is of course, a priori, not sufficient. (In fact, there are examples of nowhere 
differentiable functions for which the dyadic derivative exists almost everywhere [T2~| 
Example 3.3].) Begle and Ayres assumed that for fixed n, the slope D n (x) cannot 
jump by more than ±2 as one moves from one dyadic interval into the next. But 
this is already false for n = 4, as D±(x) = —2 for 7/16 < x < 1/2, and D±(x) = 2 for 
1/2 < x < 9/16. 

The paper by Begle and Ayres appears to have been forgotten soon after its pub- 
lication, as was Hildebrandt's note. In any case, there is no evidence in the literature 
that the mistake was ever noticed - until a few years ago, that is. We learned of Begle 
and Ayres' work from an historical survey by Prof. H. Okamoto, written in Japanese. 
Knowing that Prof. M. Kriippel [15] had recently written about the improper deriva- 
tives of the Takagi function, we sent him a courtesy notification. Kriippel's stunning 
reply was that, while he did not know about Begle and Ayres, the result could not 
possibly be true, as his own paper contained a counterexample! (Curiously, Lemma 
7.4 of Anderson and Pitt [12] implies the same incorrect statement. In their case, 
however, the culprit appears to be a typographical error.) 

We present Kriippel's example here in somewhat simplified form. Let x = 
X^^li 2 _a ™, where a n = 4 n . Then certainly D n (x) — > 00. A well-known formula 
for T(x) at dyadic rational points is 

, \ , fe-i 
k 



T . 

2 r 



^J>-2 Si ), (2.2) 



j=0 



where Sj is the number of ones in the binary representation of the integer j. (There 
are several ways to derive this formula; see Section HI) For given m, let k be the inte- 
ger such that k/2 m < x < (k+l)/2 m . Then the secant slopes over the dyadic intervals 
[k/2 m , (k + l)/2 m ] containing x indeed tend to +00 in view of ( 12. II) . However, if we 
put m = a n+ i — 1, then s k = n, s k -\ = n + a n+1 -a n -2, and s k -2 = n + a n+1 — a n -3. 
Thus, (E2} yields 



T ,k±l\_ T /k-2 



= 3m — 2s k — 2sfc_i — 2sk- 

= 4a n - a n+ i -6n + 7 
— > —00, 
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as n — > oo. Since the intervals [(k — 2)/2 m , (k + l)/2 m ] also contain x, it follows that 
T cannot have an infinite derivative at x. 

Intrigued by these developments, the present authors and Prof. Kriippel inde- 
pendently set out to find the correct answer. The result: 

Theorem 2.2 (Allaart and Kawamura, Kriippel). Let x G (0, 1) be non-dyadic, and 
write 

oo 

x = ^V a ", 

n=l 

where {a n } is a strictly increasing sequence of positive integers. Then T'(x) = oo if 
and only if 

a n+ i - 2a n + 2n- log 2 (a n+ i - a n ) ->■ -oo. (2.3) 

By the symmetry of the Takagi function, T'(x) = — oo if and only if T'(l — 
x) = oo. It is easy to see from the definition (11.11) that if x is a dyadic rational, 
then T'Ax) = +oo and T'_(x) = — oo, where T' + and T'_ denote the right- and left- 
hand derivatives of T, respectively. Combined with these facts, Theorem 12.21 gives a 
complete characterization of the infinite derivatives of T. 

In fact, the condition ( 12. 31) is necessary in order that T'_(x) = +oo. For T' + (x) = 
+oo, it is sufficient that a n — 2n — > oo, and this can be seen to be equivalent 
to the Begle and Ayres condition that D n (x) — > oo. Allaart and Kawamura [H] 
give several examples illustrating the condition (I2.3p . For instance, the condition 
holds for a n = 3n; for any increasing polynomial of degree 2 or higher; and for any 
exponential sequence a n = \_a n \ with 1 < a < 2. On the other hand, it fails whenever 
lim sup^^ a n+ i/a n > 2. The logarithmic term in (12.31) is sometimes a difference 
maker: The sequence a n = 2" does not satisfy (12. 3ft : neither does a n = 2 n + n. But 
a n = 2 n + (1 + e)n satisfies (12T3I) for any e > 0. 

Theorem 12.21 implies that, if the density d\(x) exists and lies strictly between 
and 1/2, then T\x) = oo. By symmetry, T'(x) = — oo if 1/2 < di(x) < 1. As a 
result, the sets {x : T\x) = oo} and {x : T\x) = — oo} have Hausdorff dimension 1. 

2.2 Continuity properties 

Since T is nowhere different iable, it is certainly not Lipschitz. However, Shidfar and 
Sabetfakhri [68J showed that T is Holder continuous of any order a < 1. That is, for 
each < a < 1, there is a constant C a such that 

\T(x)-T(y)\<C a \x-y\ a , 

for all x and y in [0,1]. This result prompted an interesting question. A theorem 
of Marcinkiewicz says that for every Lipschitz function / on [0, 1] there is a C 1 
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function g which agrees with / outside a set of arbitrarily small measure, and Brown 
and Koslowski [19J wondered if the Lipschitz requirement in this theorem can be 
replaced with the weaker condition that / be Holder continuous of any order a < 1. 
They show that this is not so: the Takagi function provides a counterexample, since 
for any set M of positive Lebesgue measure, the set of difference quotients 

T(x) — T(v) 

— ^ — : x E M, y e M and x 4- y 

x-y 

is unbounded. 

While the Takagi function is not Lipschitz, it does satisfy a local Lipschitz condi- 
tion at each "slow point". Abbott et al. [1] call a point x in [0, 1] a slow point with 
constant K (K a positive integer) if |D n (a;)| < K for every n. They show that there 
is a uniform bound P = P(K) > such that for each slow point x with constant K 
and for each y e [0,1], \T(y) — T(x)\ < P\y — x\. They also manage to compute the 
Hausdorff dimension of the set of slow points with constant K; it is 1 + log 2 r, where 
r = cos(7r/(2(i^ + 1))). The paper by Abbott et al. also contains results for more 
general functions based on the tent map; we will return to it in Section [7J 

The result of Shidfar and Sabetfakhri was sharpened by Anderson and Pitt [12] , 
who showed that not only T, but every function / in the so-called Zygmund space 

is Lipschitz of order 9{y) = ylog(l/y); that is to say, there is a constant M such 
that, for all x and y with y > sufficiently small, 

\f(x + y)-f(x)\<My\og(l/y). 

From this, one can deduce that the graph of T has Hausdorff dimension 1, a result 
first obtained by Mauldin and Williams [59] on which we will elaborate in the next 
section. 

More precise estimates on the oscillations of T were obtained by Kono |44j . He 
describes both the "worst-case" behavior and the "typical" size (in the Lebesgue 
sense) of the oscillations. Let 

a u (h) = \og 2 (l/h) and a t (h) = y / \og 2 (l/h), h > 0. 

Theorem 2.3 (Kono 1987). The oscillations of the Takagi function satisfy 

r T(x) - T(y) T(x)-T(y) 

lim SUp r rr- = 1 = — llHl 1IU 



>0 (x - y)a u (\x - y\) \x-v\-m (x - y)a u (\x - y\)' 



The extremal case of the above theorem is rare - at most points x the oscillations 
are of a smaller order. 
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Theorem 2.4 (Kono 1987). For almost every x G [0, 1], we have 

T{x + h)-T{x) 1 . „ T(x + h)-T{x) 

lim sup j^^^=^^^= = 1 = — lim mi 



h->o ha l (\h\)^2\og\oga l (\h\) ha ^\ h \) ^21oglog^(|/i|)' 

Kono proves the last theorem by developing T(x) in terms of Rademacher func- 
tions and applying the law of the iterated logarithm. Note that for fixed x G [0, 1], 
Theorem 12.31 implies 

- 1 < liminf T{X + h) ~ T(X) < limsup T ^ + h)-T{x) 

"W h\og 2 {l/\h\) -T-!o P Mog 2 (l/H) - {2A) 



Within these bounds, various kinds of behavior are possible. Kriippel [45] shows that 
if x is dyadic rational, then 

k-0 |/»|log 2 (l/|/»|) 
Allaart and Kawamura [8] characterize for which points x the limit 

*-* ftl0g 2 (l/|/!|) 1 ' 

exists. This requires the following definition. 

Definition 2.5. Let x G [0, 1] be non-dyadic, and let {a n } and {b n } be the (unique) 
strictly increasing sequences of positive integers such that 

oo oo 

bn 



^2—, l-x = ^2 



x 

n=l n=l 

We say x is density-regular if (ii(x) exists and one of the following holds: 

(a) < di(x) < 1; or 

(b) di(x) = and a n+1 /a n — > 1; or 

(c) g?i(x) = 1 and b n+ i/b n — > 1. 

Theorem 2.6 (Allaart and Kawamura 2010). Let x be non-dyadic. The limit in 
( 12. 5 p exists if and only if x is density-regular, in which case the limit is equal to 
d (x) — di(x). 

One can also consider the probability distribution of T(x + h) — T(x) for small 
h when x is chosen at random from [0, 1]. Gamkrelidze adapts Kono's approach 
to give the following Central Limit Theorem-type result: 

toA ( L : r <^'»- r W < y\) = "= f" e-***. 

w ^\ ftvio g2 (i/A) J; vsf 

where A denotes Lebesgue measure. 
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3 Graphical properties 



Figure [U shows the graph of the Takagi function, restricted to the interval [0, 1]. A 
number of features quickly jump out. The graph is symmetric about the line x = 1/2, 
and it has cusps and local minima at the dyadic rational points. An important aspect 
of the graph is that it has two 1/4-scale copies of itself at its top; this is due to the 
self-affine nature of T, and leads quickly to the fact, observed by many an author, 
that the absolute maximum of T is attained at uncountably many points. More 
specifically, we have 

Theorem 3.1 (Kahane 1959). The maximum value ofT is 2/3. The set Ai of points 
where T attains the maximum value is a perfect set of Hausdorff dimension 1/2, and 
consists of all the points x with binary expansion satisfying S2 n -i + ^2n = 1 f or e-CLch 
n. 

Proof. The simplest way to see this is to rewrite (jl.ip as 

oo 1 

T(x) = J2^M^), 

71=0 

where <px is the "table-top" function (pi(x) = 4>(x) + (l/2)0(2x). Let 

n— 1 -. 

T n (x):=J2 ¥ <P(2 k x)> (3-1) 

k=0 

and note that T 2n (:r) = Xlfc=o 4~ fc 0i(4 fc x), for n G N. Figure |2] shows the graphs of 
T2 and T4. One sees by induction that the maximum value of T2 n is 

111 1 /r " _L 



and hence, 



2 2 i f 2 V4 



00 1 /1\ 

M := max{T(x) : x E [0, 1]} = £ - f - J 



2 
3' 



If x G [0, 1], then T(x) achieves this maximum value of 2/3 if and only if x lies in 
the middle half of each quarternary interval to which it belongs - in other words, 
if the quarternary expansion of x contains only l's and 2's. In terms of the binary 
expansion x = Y^=o e n/'^ n of x, it means that e 2n -i + ^2n = 1 for each n. Thus, 
the set M. := {x G [0, 1] : T(x) = M} is a Cantor-like set constructed by removing 
at each step the two outside fourths of each remaining quarternary interval. As a 
result, dim H M = log2/log4 = 1/2. □ 
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Figure 2: The functions T 2 (x) = 4>x{x) (left) and T 4 (x) = (f>i(x) + (l/4)0i(4x) (right) 



(Kahane did not show that the dimension of Ai is 1/2, but he easily could have: 
the technique for calculating the dimensions of generalized Cantor sets was by 1959 
well established.) 

3.1 Humps and local extrema 

The appearance of smaller-scale similar copies of the graph of T is not limited to the 
central part of the graph; it happens everywhere. We introduce two definitions and 
a lemma to make this precise. Let 

Q T ■= {(x,T(x)) : < x < 1} 

denote the graph of T over the unit interval [0,1]. The term 'balanced' in the 
following definition is taken from Lagarias and Maddock |50j. 

Definition 3.2. A dyadic rational of the form x = O.E1E2 ■ ■ ■ £im is called balanced if 
D2m{%) = 0. If there are exactly n indices 1 < j < 2m such that Dj(x) = 0, we say x 
is a balanced dyadic rational of generation n. By convention, we consider x = to be 
a balanced dyadic rational of generation 0. The set of all balanced dyadic rationals 
is denoted by B. For each n G Z + , the set of balanced dyadic rationals of generation 
n is denoted by B n . Thus, B = U^Lo^™- 

Lemma 3.3. Let m 6 N, and let x = k/2 2m = O.E1E2 ■ ■ .E2m be a balanced dyadic 
rational. Then for x E [k/2 2m , (k + l)/2 2m ] we have 

T(x)=T(x ) + ^T {2 2m (x-x )). 
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Order 1 Generation 2 




Figure 3: The "humps" if (1/4), if (5/8) and if (7/8), enclosed in rectangles from left 
to right. Note that in binary, 1/4 = 0.01, 5/8 = 0.1010, and 7/8 = 0.111000. Only 
the first of these, if(l/4), is a leading hump. 

In other words, the part of the graph ofT above the interval [k/2 2m , (k + l)/2 2m ] is a 
similar copy of the full graph Qt, reduced by a factor l/2 2m and shifted up by T(x )- 

Proof. This follows immediately from the definition (II. ip . since the slope of T 2m over 
the interval [k/2 2m , (k + l)/2 2m ] is equal to D 2m (x ) = 0, and T(x ) = T 2m (x ). □ 

Definition 3.4. For a balanced dyadic rational Xq = k/2 2m as in Lemma |3.3| let 
H(xq) denote the portion of the graph of T restricted to the interval [k/2 2m , (k + 
l)/2 2m ]. By Lemma |3.3| H(xq) is a similar copy of the full graph Qt] we call it 
a hump. Its height is §(|) , and we call m its order. By the generation of the 
hump H(xo) we mean the generation of the balanced dyadic rational Xq. A hump 
of generation 1 will be called a first-generation hump. By convention, the graph Qt 
itself is a hump of generation 0. If Dj(xq) > for every j < 2m, we call H(xq) a 
leading hump. See Figure [3] for an illustration of these concepts. 

The Takagi function T takes on a local maximum value at a point x = Y2^=o £ n/2 n 
precisely when the point (x,T(x)) is located at the top of some hump. This is the 
case if and only if for some m e N, 

£! + ••• + S2m = m , and e 2n -i + £ 2n = 1 for each n > m. 

The first part of the above condition ensures that (x,T(x)) lies on a hump of order 
m; the second part implies that it lies at the top of that hump. In particular, the 
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points of local maximum of T lie dense in [0, 1]. This result too is due to Kahane 
|34j . On the other hand, since T' + (x) = oo and T'_(x) = — oo at each dyadic x, T has 
a local minimum value at every dyadic rational point x. Kahane shows that there 
are no local minima at non-dyadic points. 

3.2 Humps and Hausdorff measure 

It is often necessary to count the humps of a given order and/or generation, and this 
counting involves the Catalan numbers 

Cn--=—r( , n = 0,l,2,.... 
n+l\n J 

Lemma 3.5. Let m G N. 

(i) There are ( 2 ^) humps of order m. 

(ii) There are C m leading humps of order m. 

(Hi) There are 2C m _i first- generation humps of order m. 

This lemma is extremely helpful in the study of the level sets of T (see Section [6]). 
Another use is the following. Mauldin and Williams [59] first showed that the graph 
of T has Hausdorff dimension one, but remarked that they did not know whether 
it has a-finite linear measure. Anderson and Pitt [12] showed that the answer is 
affirmative, not only for the Takagi function but for a much wider class of functions 
(the so-called Zygmund space A^). Odani [60j explicitly decomposed the graph of T 
into countably many sets of finite linear measure, as follows. 

Let S denote the set of points (x, y) on which belong to humps of infinitely 
many generations, and for n = 0,1,2,..., let E n denote the set of points which 
belong to a hump of generation n, but not to a hump of generation n + 1. Then 

g T = s U E U E ± U E 2 U . . . . 

Since Eq is the graph of T with all the first-generation humps removed, it is intuitively 
clear (and can be made rigorous) that the restriction of T to irx(Eo) is monotone 
increasing on [0,1/2], and monotone decreasing on [1/2,1]. Hence, Eq has finite 
linear measure. Next, E\ consists of countably many copies of Eq, one inside each 
first-generation hump. For each m G N, there are 2C m _i copies with contraction 
ratio l/4 m by Lemma [3.51 Thus, 

00 /i\ m 1 00 /i\ n 

h\e 1 ) = Y, 2C ^(j) « 1 (^) = 2E C » 4 n\E ) = 'H 1 (E ), 

m=l ^ ' n=0 ^ ' 
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where we have used the well-known fact that 'Y^ =a C n (l/ A) n = 2. Inductively, 
this argument can be continued to show that 'H 1 (i? n ) < oo for each n. It remains 
to verify that ft 1 (5) < oo. Order the first-generation humps Hi,H>2,... in some 
arbitrary manner, and for i G N, let <3>j be the similarity map which maps Qt onto 
Hi. Then it is easy to check that 

oo 

s = {Jms). 

i=i 

The open set condition is satisfied (take (0, 1) x M, say). As above, the contraction 
ratios of the $j sum to 1, so Moran's equation gives dim# S = 1. An easy exercise 
(it is clear which coverings to use) shows that "H^S 1 ) < oo. Thus, the graph of T has 
cr-finite linear measure. (In fact, H 1 ^) > as well, since n x (S) has full Lebesgue 
measure.) 

Essentially the same construction is given by Buczolich [21], Theorem 9]. He 
shows additionally that the set S is an "irregular" 1-set, meaning that it intersects 
every continuously differentiable curve in a set of K 1 -measure zero. In [201 Theorem 
10], Buczolich shows also that the Takagi function is "micro self-similar" , in the sense 
that the graph of T itself is a micro tangent set of T at almost every point x G [0, 1]. 

4 Alternative representations of T(x) 

While (11 .ip is arguably the simplest and certainly the most common expression for 
T(x), many other representations occur in the literature, and most have some unique 
advantage in proving certain things about the Takagi function or its generalizations. 

1. Dynamical systems view. To begin, put ip(x) := 2<fi(x) for x G [0, 1], and note 
that ip is a special case of a "tent map", which maps [0, 1] onto itself. It is easy to 
see that we can write (II. ip as 

oo 

T(x) = J2^ in) (x)i are [0,1], (4.1) 

n=l 

where ip^ denotes n-fold iteration of ip. Since Yl^Li l/2 n = 1, (14. ip represents T(x) 
as a weighted average of the iterates of x under the chaotic dynamical system ijj. 

2. Takagi's definition. Most authors define the Takagi function by either (11.11) 
or (14. ip . but it should be pointed out that Takagi himself defined T(x) differently. 
For n G N, let a n denote the number of binary digits among {ei, . . . , e„_i} that are 
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different from e n . In other words, a n = O n {x) if e n = 1, and a n = I n (x) if e n = 0. 
Takagi defined T(x) by 

oo 

n*) = EF- (4 - 2) 

n=l 

To see that (II. ip and (14 .2D are equivalent, express <p(x) in terms of the binary ex- 
pansion of x by 

««) = f; * (1 -* ) ; (1 -*K 

k=i 

This generalizes to 

0(2"x) _ g n +j(l ~ gnj-l) + (1 ~ gn+j) g n+l / . „x 

We can similarly write a n = e n O„(a;) + (1 — e n )I n (x). Inserting (14. 3 p into (II. ip and 
interchanging summations it is now easy to obtain (I4.2p . (This is essentially the 
proof given by Lagarias and Maddock [501 Lemma 2.1].) 

Kono [B] , and later Gamkrelidze [29] , used a form similar to (14. 2 p (expressing a n 
in terms of the Rademacher functions X n (x) = (— l) e ") to investigate probabilistic 
properties of the graph of T. 

3. Tambs-Lyche's definition. In 1939, Tambs-Lyche [76J gave the following ex- 
pression for T{x). Write 

oo 

*=!>-<., 

3=1 

where is a strictly increasing sequence of integers (the sum being finite if x is 
dyadic). Then 

T(X)= ^2^ZA. (4 . 4) 

i=i 

This formula is useful for approximating solutions to the equation T(x) = y, as 
shown in P, Section 4.2]. 

Tambs-Lyche actually defined his function by the summation in the right hand 
side of (I4.4p . and stated without proof that it is equivalent to (II. ip . Tambs-Lyche's 
formula too has been rediscovered many times. In her thesis and subsequent publica- 
tion [38], Kawamura used a relationship with Lebesgue's singular function (explained 
in the next section) to obtain the expression 

m = f E „(*) °' w - 2 1- M+2 = £ e,!,) - 1 "''-" , (4.5) 

n=l n=l 
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which is clearly equivalent to (14. 4p . De Amo and Fernandez- Sanchez [IT] derive (14. 4p 
explicitly from (II. ip . while Kuroda [H] deduces it directly from Takagi's definition 
( 14. 2p . Here we give a short proof using ( 11. ip . Recall the definition of T n from ( 13. ip . 
and note that T n is piecewise linear with slope D n (x) at all points not of the form 
j/2 n . Moreover, T(j/2 n ) = T n (j/2 n ). Thus, if < I < m and j G Z + , we have 

i , 1 \ ^ Z' i A i n ^ i A m - 2sj 



T ^ + _ _T 4 = — D m 4 = , (4-6) 

V2' 2 m / V 2 ) 2 m \2 l ) 2 m 

where Sj is the number of l's in the binary expansion of j. This immediately gives 
T(l/2 m ) = m/2 m , and a straightforward induction argument yields 




n 



*;-2(j 



2'i 



for all n G N and integers 1 < Zi < I2 < ■ ■ • < l n - The continuity of T gives (14.41) . 

As a by-product of (14. 6 j) (putting Z = m and summing over j), we obtain the 
formula given by Kruppel [45J: 

r f4) = iE^- 2s i)' ( 4 - 7 ) 



2 m ; 2 

7 i=o 



which was used in Section 12.11 



4. Random walk definition. Lagarias and Maddock [50j Section 2] use (14.21) to 
express T(x) in terms of the sequence {D n (x)} as follows: 



r w = i-iE(-ir-«^. (4. 



- 4 „ 

This formula is useful in the study of level sets, because one easily infers from it the 
following important fact. 

Lemma 4.1. If \D n (x) \ = \D n (x')\ for every n, then T(x) = T(x'). 

Note that {D n (x)} n is a symmetric simple random walk when x is chosen at 
random in [0, 1]. Thus, (14.81) expresses T as a functional of a random walk. 

5. Fourier series. While the definition (11.11) gives T(x) directly as a Schauder 
series, it is also relatively easy to compute the Fourier series for T(x). Hata and 
Yamaguti [31] point out that 

1 2 ^-^ cos 2nkx 

fcgN, k odd 
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and this results in the Fourier series 

1 2 °° 
T(x) = - A m cos 2imix, 

m=l 

where A m = (2 n k 2 )~ 1 if m = 2 n k, with k odd. The Fourier coefficients A m satisfy 
1/m 2 < A m < 1/m. In particular, the Fourier series of T(x) is non-lacunary, in 
contrast to the Weierstrass function which is defined as a lacunary Fourier series. 



5 Functional and difference equations 

De Rham [62] was the first to point out that the Takagi function on [0, 1] satisfies 
the functional equation 

n= f(l/2)/(2s)+x, 0<x<l/2, 
T[X) \(l/2)/(2x-l) + (l-x), 1/2<x<1. { ' } 

Kairies, Darsow and Frank [36] observed (15 .ip and proved the following results: 

1. Any function / : [0, 1] — > M satisfying ( 15. ip is nowhere different iable, and 
coincides with T on the dyadic rationals. 

2. If / : [0, 1] ->• R satisfies and is bounded, then / = T. 

3. A recursive relation for the moments M n = x n f(x) dx of any function / 
satisfying ( 15. ip is given by 



1 

(n + lX"n + 2) ' 2(2 r 



k=0 



(The paper [36J was inadvertently printed before the page proofs were received; 
a list of corrections is given in [24J.) Later, extending the results of [36], Kairies [35] 
gave a list of seven functional equations satisfied by T(x), and investigated which 
subsets of these equations imply that a bounded function / satisfying them must in 
fact be the Takagi function. 



In 1984, Hata and Yamaguti [31] started a new direction by regarding the Takagi 
function and related functions as solutions of discrete boundary value problems. This 
was quite natural, since ( II .ip gives T(x) directly as a Schauder series, and from the 
Schauder expansion of a function one quickly obtains an infinite system of difference 
equations which the function satisfies. Using these difference equations, Hata and 
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Yamaguti showed that the Takagi function is closely related to another special func- 
tion: Lebesgue's singular function. Kawamura [38] later adopted their approach and 
found a close relationship between other nowhere differentiable functions, singular 
functions, and self-similar sets in the plane. 

First, we briefly recall Schauder expansions. Whereas a function's Fourier expan- 
sion uses trigometric functions, the Schauder expansion uses the "tent" functions 



SnJ.(x) 



|20(2 n x), if^<x< 
) 0, otherwise, 



i+l 
2 n 



for < i < 2 n — 1 and n G Z + . Thus, the graph of S n ^ is the regular isosceles 
triangle of unit height whose base is the interval [i/2 n , (i + l)/2 n ]. 

It is well known that every continuous function / : [0, 1] — > R which vanishes at 
and 1 has a unique Schauder expansion of the form 



oo 2 n -l 



f{ X ) = ^2 a n,i S n,i(x), (5.2) 



n=0 i=0 

where 



2 n+i j 2 I \2 n J \ 2 n 
Applying this to the Takagi function, we immediately obtain 

Theorem 5.1 (Hata- Yamaguti, 1983). The Takagi function T(x) is the unique con- 
tinuous solution of the discrete boundary value problem 

-(w)-H r (^) +r (^)}^ <M> 

where < i < 2 n — 1, n £ Z +; and the boundary conditions are T(0) = T(l) = 0. 

Next, recall Lebesgue's singular function. Imagine flipping an unfair coin with 
probability r 6 (0,1) of heads and probability 1 — r of tails. Note that r ^ 1/2. 
Let the binary expansion of t G [0,1]: t = Y^ = i^>n/2 n be determined by flipping 
the coin infinitely many times. More precisely, u n = if the n-th toss is heads and 
uj n = 1 if it is tails. We define Lebesgue's singular function L r (x) as the distribution 
function of t: 

L r (x) := Prob{t <x}, < x < 1. 

With the function L r is associated a probability measure /i r on [0, 1], called the bino- 
mial measure, under which the binary digits of a number t G [0, 1] are independent, 
taking the values and 1 with probabilities r and 1 — r, respectively. 
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It is well-known that L r (x) is strictly increasing, but its derivative is zero almost 
everywhere. De Rham [63j showed that L r (x) is the unique continuous solution of 
the functional equation 

L (x) = l rLr{2x) > 0<x<I, 
rV ; \(l-r)L r (2x-l)+r, \ < x < 1. V ' ; 

Hata and Yamaguti showed that L r (x) is also the unique continuous solution of the 
following discrete boundary value problem: 

2z + 1\ , /i\ fi + 1\ , . 

'l-r)L r - )+rL r — - , (5.5) 



2«+i / v \ 2 n ) V 2" 

where < % < 2 n — 1 and n e Z + . The boundary conditions are L r (0) = and 
L r (l) = 1. From (15.31) and (15. 5p . they proved the important and useful relationship 



ld_ 

2d^' 



-—L r (x) 



= T(x). (5.6) 

r=l/2 



This identity can also be obtained from the following expression for L r (x), due to 
Lomnicki and Ulam [53] : 

oo 

L r (x) = _^V;e n r n - J »(l-r) J ». (5.7) 
1 — r z — ' 

n=l 

Differentiating this with respect to r and setting r = 1/2 gives the right hand side 
of (14. 5j , and hence we have (I5.6p . (In [38J the reverse approach is taken, and (I4.5P is 
derived from (15. 7p using (I5.6p .) We note that (I5.6P also leads to a very short proof 
of (14 .7p : It is easy to see that 

fe-i 



j=0 

where Sj is the number of l's in the binary expansion of j. Differentiating gives (14. 7ft . 

The functional equations (15. ip and ( 15. 4 p are both special cases of the general 
family of functional equations studied by de Rham [63]. De Rham considers the 
Takagi function and Lebesgue's singular function in separate papers, and does not 
appear to have noticed the relationship ( 15. 6ft . 
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5.1 Evaluating T(x) for rational x 



One particular use of the functional equation (15.11) is to the exact evaluation of T(x) 
for rational x. As noted by Knuth (JH P- 32, p. 103], T(x) is rational whenever x 
is, and by applying (15. 11) repeatedly one obtains a system of linear equations which 
is easily solved. We give the details here, and also examine the number of iterations 
required to compute T(x). 

Let x = p/q, where p, g G N with gcd(p,q) = 1. Assume first that p < q/2. 
Then, by (15. lft and the symmetry of T, we can write T(p/q) = \T{jp' /q) + (p/q), 
where p' = min{2p, q — 2p}. If q is even, the fraction p'/q simplifies. If q is odd, 
then gcd(p', q) = 1 again. These ideas lead to the following two-stage algorithm for 
evaluating T(p/q): 

Step 1. Let q = 2 m q', with q' odd, and assume gcd(p,q) = 1. Put go : = Q, 
p := min{p, q - p}, and g,- := qj-i/2, pj := min{pj_i, qj - Pj-i}, j = 1, • • • , m. Let 
p' '■— Pm- Then, after applying the functional equation m times, putting the results 
together and simplifying, we obtain 



So it remains to compute T(p/q) for odd q. 

Step 2. Let q be odd and gcd(p,q) = 1. Put p := min{p, q — p}, and pj := 
min{2p i _i,g - 2p i _ 1 }, j = 1,2, ... . Note that Tfa/q) = ±T(p j+1 /q) + (pj/q) for 
each j > 0. Since pj = ±2p J -_ 1 (mod g), we have pj = ±2 J p (mod g) for each j, 
so there will be some positive integer j such that pj = po- The smallest such j is 
the number k := min{j G N : 2 J ; = ±1 (mod g)}. Such a j always exists by Euler's 
theorem, and k < <p(q), where the Euler function <p(q) denotes the number of integers 
in {1, 2, . . . , q — 1} relatively prime to q. Let ord 9 (2) denote the order of 2 in the 
group of units of Z/gZ. That is, ord g (2) is the smallest positive integer n such that 
2™ = 1 (mod q). It is an elementary exercise in number theory to show that 



It now takes precisely k iterations of the functional equation to express T(p/q) in 
terms of itself. Solving for T(p/q) and simplifying, we eventually obtain 





\ ord 9 (2), if q\2 j + 1 for some j G N 
ord g (2), otherwise. 
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The inverse problem is also interesting: given a rational y in the range of T, 
is there a rational point x G [0, 1] such that T(x) = y? This question is still open. 
Knuth |4T1 p. 103] gives an algorithm which produces solutions of the equation T(x) = 
y for many rational y. This involves starting with an initial value v and walking along 
a particular directed graph, updating v by some fixed arithmetic operation at each 
node. Which node may be visited next depends not only on the graph but also on 
whether the transition will keep the value of v within certain bounds. The algorithm 
terminates when one visits a node for the second time with the same value of v. It is, 
however, not known whether the algorithm always terminates. A different method 
which gives solutions for many (but not all) rational y is given by Allaart (6j Section 
4.2]. 



6 Level sets 

In this section we consider the level sets 

L(y) :={xe [0,1] :T(x)=y}, y E R. 

Of course, L(y) = if y & [0,2/3]. The simplest level set is L(0) = {0,1}. At 
the other extreme, in view of Theorem 13. 1[ we have that L(2/3) is an uncountable 
(Cantor) set of dimension 1/2. In general, L(y) can be finite, countably infinite or 
uncountable, and which of these three possibilities is the most common depends on 
the precise mathematical meaning assigned the word "most". If L(y) is finite, it 
must have an even number of points by the symmetry of the graph of T, but any 
even positive number is possible. If L(y) is uncountable, its Hausdorff dimension can 
be zero or strictly positive, but never more than 1/2. 

Level sets are partitioned into easier to understand pieces called local level sets. 
Each local level set is either finite or a Cantor set, and the members of a local level 
set are easily obtained from one another by certain combinatorial operations ( "block 
flips" ) on their binary expansions. A level set can consist of finitely many, countably 
infinitely many, or uncountably many local level sets. As with the cardinalities of 
the level sets, which of these three is the most common depends on how one defines 
"most" . Many questions about the level sets of T remain open. 

6.1 Finite or infinite? 

The level sets of the Takagi function and related functions were first considered 
by Anderson and Pitt [12] . Their Theorem 7.3, which applies to a large class of 
functions, implies that L(y) is countable for almost every y e [0, |]. This result was 
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improved recently by Buczolich |21j . who focused on the Takagi function itself and 
concluded the following. 

Theorem 6.1 (Buczolich 2008). For almost every ordinate y, L(y) is finite. 

Sketch of proof (Allaart 2011). We sketch a proof here that is slightly different from 
the original proof given by Buczolich, and which uses the notion of humps and leading 
humps defined in Section [31 For the full details, see [5l Section 3] . 
Observe first that Lemma 14.11 immediately implies the following. 

Lemma 6.2. For every hump H there is a leading hump H' of the same order and 
generation as H, such that ity(H) = tty(H'). On the other hand, for every leading 
hump H' there are only finitely many humps H such that tty(H) = tty{H'). 

Next, define a set 

X*:=[0,1]\ |J I(x ). 

In other words, X* is obtained by removing all the dyadic closed intervals above 
which the graph of T has a first-generation hump. The importance of X* is made 
clear by the next lemma, which we state here without proof. 

Lemma 6.3. The Takagi function T maps X* onto [0, |]. Moreover, T is strictly 
increasing on X* R [0, ~). 

Lemmas 16.21 and 16.31 can be used to prove the crucial fact that L(y) is finite 
whenever the horizontal line l y at level y intersects only finitely many leading humps. 
Using this fact, the proof of the theorem can be completed as follows. If y is chosen 
at random from [0, |] and if is a leading hump of order m, the probability that the 
line l y intersects if is (j) m . Letting H' denote the set of all leading humps, this gives 

oo 

P(yeMH)) = Y, C ' 

HeW m=0 

since there are C m leading humps of order m by Lemma [3~5| and C m ~ 4 m / (m 3 / 2 y/ir). 
Thus, by the Borel-Cantelli lemma, the probability that l y intersects infinitely many 
leading humps is zero. Therefore, L(y) is finite with probability 1. □ 

Despite the above result, the average cardinality of the level sets of T is infinite. 
That is, 

/ \L(y)\ = oc. 
Jo 

This was shown by Lagarias and Maddock [51J. An alternative proof based on Lemma 
16.31 is given by Allaart [5] . 
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Whereas the above results are probabilistic in nature, a quite different picture 
emerges when one views the level sets of T from the perspective of Baire category. 
Define the sets 

Soo := {v e [0' f] : L(y) is countably infinite}, 
Soo := {v e [0' §] : L(y) is uncountably infinite}. 

Theorem 6.4 (Allaart 2011). The set S£; has the decomposition = E U M, 
where E is a dense Gs set, and M is a countable set disjoint from E which consists 
exactly of the local maximum ordinate values of T . As a result, the set {y G [0, |] : 
L(y) is countable} is of the first category. 

For the proof and a more explicit description of the set E, see Allaart [2J Sec- 
tion 4]. There it is also shown that does not contain any dyadic rational ordinates 
y. Note that the residual set has Lebesgue measure zero by Theorem 16. II How- 
ever, it has full Hausdorff dimension one; see Lagarias and Maddock |51j . 

The set is rather more difficult to describe. It contains the images of all dyadic 
rational abscissas x in [0,1], so it is dense in [0, |]. But it is not known whether it 
contains more. 

Closely related to the level sets of T is the occupation measure defined by (Pr(A) = 
\({x G [0,1] : T(x) G A}), for Borel sets A C R. Buczolich (2TJ shows that /x T 
is singular with respect to Lebesgue measure. This is witnessed by the set S from 
Section 13^2] It is relatively straightforward to show that = oo when y G 7Ty(S), 

so Theorem 16.11 implies X(tty(S)) = 0. On the other hand, tcx{S) has full measure in 
[0, 1], because almost every x G [0, 1] has the property that D n (x) = for infinitely 
many n. Consequently, Ht{^y (£>)) = X(^x(S)) = 1. 

6.2 Cardinalities of finite level sets 

Since almost all level sets are finite in the Lebesgue sense, it is natural to ask what 
these finite cardinalities can be. By the symmetry of T and the fact that L(T(~)) = 
L(~) is countably infinite, the number of points in each finite level set must be even. 
Allaart [6J shows that, vice versa, every positive even number occurs. In fact, we 
have the following. Let 

S 2n := {y : \L{y)\ = 2n}, neN. 

Theorem 6.5 (Allaart 2011). For each n G N ; 5*2 n is uncountable but nowhere 
dense. 
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It is not known whether S2n has positive Lebesgue measure for each n. The 
argument used in the proof of Theorem 16.51 unfortunately does not give enough to 
prove this stronger statement. As a partial result in this direction, however, Allaart 
|6J shows that A(5 , 2 n ) > whenever n is either a power of 2, or the sum or difference of 
two powers of 2. And for the specific case of £2, fairly tight bounds on its Lebesgue 
measure can be given which show, not surprisingly perhaps, that 2 is the most 
common finite cardinality. 

Theorem 6.6 (Allaart 2011). The Lebesgue measure A(S , 2) 0/S2 satisfies 

T2 < ^ < I 

Since the graph of T has height 2/3, this implies that if an ordinate y is chosen at 
random in the range of T, the probability that L(y) contains exactly two points lies 
between 62.5% and 72.9%. The proof of Theorem 16.61 is based on a simple counting 
argument which uses the fact, from Lemma [331 that the graph of T contains exactly 
C m _i first-generation leading humps. See j6] for the details. 

Which specific ordinates y satisfy = 2? It is clear from the graph that y 

must be less than |. Allaart [6] gives two sufficient conditions, the first condition 
being directly in terms of the binary expansion of y. 

Theorem 6.7 (Allaart 2011). Let < y < | such that y is not a dyadic rational, 
and suppose the binary expansion of y does not contain a string of three consecutive 
0's anywhere after the occurrence of its first 1. Then \L(y) \ = 2. 

Thus, for instance, the set S2 includes the points y = 1/3 = 0.(01)°°, y = 1/7 = 
0.(001)°°, and y = 1/40 = 0.0 5 (1100)°°. 

The second condition, which is neither weaker nor stronger than the first, involves 
the orbit of y under iteration of the map 

y (x)= fo, ify = o 

|4(j/-£), if ±<y<±Fk, A; = 3,4,.... 

Note that ^> maps [0, ~) onto itself. Let ty n denote the nth iterate of with 
^°(y) := y. For n = 0,1,2,..., let k n be that number k > 3 for which k/2 k < 
^ n (y) < (k — l)/2 fc -\ or put k n = 00 if ^ n (y) = 0. 

Theorem 6.8 (Allaart 2011). Let < y < \. If k n+1 < 2k n for every n, then 

\L(y)\ = 2. 
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This gives many more examples. For instance, the condition of the theorem 
obviously holds for the fixed points of which are y* k := k/(3-2 h ~ 2 ), k > 4. Perhaps 
unexpectedly, this shows that infinitely many dyadic rational ordinates belong to 
5*2, the first three being 1/8, 3/2 7 and 1/2 8 . More generally, of course, many of 
the periodic points of \I/ satisfy fc n +i < 2k n and are therefore in S2. For instance, 
y = 1/11 does not satisfy the "no 3 zeros" condition of Theorem 16. 7\ but it has 
(k n ) n > = (7, 6, 5, 5, 6, 5, 5, 4, 4, 4)°°, and since 7 < 2-4, Theorem EH yields 1/11 G S 2 . 

Allaart [6] actually gives a slightly weaker condition than the one given in Theo- 
rem ESI which includes lower order terms, and also gives an accompanying necessary 
condition in terms of the sequence (k n ). Together these conditions cover most cases, 
but still leave a small gap. 

Theorems 16.71 and 16.81 are corollaries to the following, precise but somewhat ab- 
stract, characterization of membership in S^. 

Theorem 6.9 (Allaart 2011). Let < y < \. Then \L(y)\ = 2 if and only if 



6.3 Hausdorff dimension 

Recall from Section [3] that the maximum value of T is 2/3, and L(2/3) is a Cantor set 
of Hausdorff dimension 1/2. One might ask whether there exist any level sets with 
Hausdorff dimension strictly greater than 1/2. This question was first addressed by 
Maddock [55], who proved that the intersection of the graph of T with any line of 
integer slope has Hausdorff dimension at most 0.668. In particular, 0.668 is an upper 
bound for the dimensions of the level sets. Maddock himself conjectured that the 
real maximum is 1/2. The issue was finally settled by de Amo et al. [9]. 

Theorem 6.10 (de Amo et al. 2011). For each ordinate y, the box-counting dimen- 
sion of L(y) is at most 1/2, and hence, dim h L(y) < 1/2. 

The proof, which is surprisingly elementary, makes good use of the self-affinity of 
the graph of T and uses a cleverly devised induction argument. 

A direct consequence of Theorem 16 .11 is that almost all level sets (in the Lebesgue 
sense) have Hausdorff dimension zero. On the other hand, Lagarias and Maddock 
[5T] have shown that the set 



®{ty n {y)) > - for alln>0 



where 




k 



3,4,.... 



{y : dim H L(y) > 0} 
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has full Hausdorff dimension one. This is accomplished by putting a sequence of 
subsets of the range [0, |] in one-to-one bi-Lipschitz correspondence with certain well- 
behaved subsets of the domain [0, 1] whose Hausdorff dimension is easy to calculate 
and gets arbitrarily close to 1. 

It is not known exactly which numbers occur as the Hausdorff dimension of some 
level set of T. 

6.4 Local level sets 

Lagarias and Maddock [50, El] introduce the concept of a local level set of the Takagi 
function. They first define an equivalence relation on [0, 1] by 

x~x' 44 \Dj(x)\ = \Dj(x')\ for each j G N. (6.1) 
The local level set containing x is defined by 

L l ° c := {x 1 : x' ~ x). 

Note that by Lemma 14.11 x ~ x' implies T(x) = T(x'), so each local level set is 
contained in some level set. Lagarias and Maddock point out that each local level 
set is either finite or a Cantor set. Members of the same local level set can be 
obtained from one another by simple operations on their binary expansions, called 
"block flips" in [50]. This works as follows. Let Z(x) = {n > : D n (x) = 0} U {oo}. 
For any two elements k, I G Z(x) with k < I < oo, form the point x' with binary 
expansion x' = Y^=i ^" n£ 'n by setting e' n = e n if n < k or n > /, and e' n = —e n if 
k < n < I. Then |D n (a;')| = |-D n (a;)| for each n, and so x' G L l ° c . Every element of 
L l ° c can be obtained from x by at most countably many operations of this type. 

One of the results in [50] concerns the average number of local level sets contained 
in a level set chosen at random. Let N loc {y) denote the number of local level sets 
contained in L(y). 

Remark 6.11. Lagarias and Maddock [50] define L l ° c slightly differently, effectively 
viewing local level sets as subsets of the Cantor space {0, 1} N . However, this distinc- 
tion does not affect the number of local level sets contained in any level set, which 
is all we are concerned with in this survey. 

Theorem 6.12 (Lagarias and Maddock, 2010). The expected number of local level 
sets contained in a level set L{y) with y chosen at random from [0, |] is |. More 
precisely, 

o /"2/3 o 

m l ° C (y)]--=^J o N l °\y)dy= 6 -. 
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A simpler proof of this theorem is given by Allaart [5]. In that paper, local level 
sets are also examined from the category point of view. The result is in marked 
contrast with the conclusion of Theorem 16.121 Define the sets 

Sco := {y '■ L{y) contains infinitely many different local level sets}, 
S l ^' uc '■= {y ■ L(y) contains uncountably many different local level sets}. 

Theorem 6.13 (Allaart 2011). (i) The set S 1 ^ is residual (co-meager) in [0, |]. 

(ii) The set S l ™' uc is dense in [0, |] ; and intersects any subinterval of [0, |] in a 
continuum. 

6.5 Open problems 

A number of interesting questions about the level sets of the Takagi function re- 
main open. We give a brief selection here, and refer to Lagarias [H] for additional 
problems. 

Problem 6.1. Describe the set of ordinates y with a countably infinite level set. 
Or less ambitiously, determine whether contains any points which are not the 
image of a dyadic rational. 

Problem 6.2. (Knuth |JTJ p. 32, Exer. 83]) Does there exist, for each rational 
ordinate y G [0, |], a rational abscissa x G [0, 1] such that T{x) — yl If x is irrational 
but y = T(x) is rational, must L(y) be uncountable? 

Problem 6.3. Determine the probability distribution of \L(y)\. In other words, 
find X(S2n) for each n G N. This could be very difficult, but the weaker problem of 
showing that A(S , 2 n ) > for every n (or finding a counterexample) may be solvable. 

Problem 6.4. Determine the probability distribution of N loc (y), the number of local 
level sets contained in L(y). This too may be difficult. 

Problem 6.5. The set S l ™' uc intersects each subinterval of [0, |] in a continuum. 
Is it residual? Does it have Hausdorff dimension 1? Weaker than the last question, 
does S 1 ^ have Hausdorff dimension 1? 

Problem 6.6. (Lagarias [49J) Determine the dimension spectrum of T. That is, 
determine the function 

f(a) := dim H {y : dim H L(y) > a}. 



30 



7 Generalizations 



7.1 The Takagi-van der Waerden functions 

An immediate generalization of the Takagi function is the sequence of functions 

oo 1 



2,3,.... 

n=0 



Thus, /2 is the Takagi function and /io is van der Waerden's function. Billingsley's 
argument (with only trivial modifications) shows that each f r is nowhere differen- 
tiable. Van der Waerden's original and elegant proof for the case r = 10 works for 
all even r > 4, but not for odd r or for r = 2. 

Each f r is also Holder continuous of any order a < 1. This follows from the more 
general result of Shidfar and Sabetfakhri [69J, but is also easy to prove directly. In 
fact, by analogy with (12 .4p . we have 

' ffr( X + h ) ~ fr(x) fr{x + h) - f r (x) 

—1 < hmmt — — -. — 77— - — < hmsup — — — 77— — < 1. 

" h->0 Mog r (l/|/l|) " fc^o Mog r (l/|/l|) " 

It seems likely that f r has an infinite derivative at many points, but we have not 
found any detailed study of this question in the literature. 

Generalizing Kahane's result jM], Baba [13] determines for each r > 2 the maxi- 
mum value M r of f r and the set of points E r = {x G [0, 1] : f r (x) = M r }. 

Theorem 7.1 (Baba 1984). (i) If r is odd, then E r = {1/2} and M r = r/(2r - 2). 
(ii) Ifr is even, then E r is a Cantor set of dimension 1/2, and M r = r 2 /(2r 2 — 2). 



7.2 The Takagi class 

Another direct generalization of the Takagi function is obtained by replacing the 
factor l/2 n in (11. ip with a general (real) constant c n . This gives functions of the 
form 

00 

f(x) = J2cn<P(2 n x). (7.1) 

n=0 

It is immediately clear that the series converges uniformly, and hence defines a con- 
tinuous function /, when Yl^Lo \ c n\ < °°- Hata and Yamaguti |31] show that this 
condition is also necessary, and call the collection of functions of the form ( 17. ip the 
Takagi class. They note that each member of the Takagi class solves a discrete 
version of the Dirichlet boundary value problem involving the "discrete Laplacian" 

A;, n / := / ^— j + / ~ 2 / y 2 ™+i ) ' 
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in the sense that A itn f = —c n forn > and i = 0, 1, . . . , 2 n — 1, with /(0) = /(l) = 0. 

Beside the Takagi function itself, the Takagi class contains a number of interesting 
examples from the classical literature. For instance, Faber [26] introduced the highly 
lacunary series 

oo 
k=l 

and showed that F has no derivative, finite or infinite, at any point. Moreover, he 
proved that F does not satisfy a Lipschitz condition of any order. Kahane likewise 
constructs lacunary series of the form f(x) = Y^=i Pf2^ k " '4>{2 k " 'x) , and shows how p v 
and k v can be chosen so that the modulus of continuity of / is majorized (respectively 
minorized) by a given function satisfying appropriate conditions. 

Many of the functions in the Takagi class are fractals, in the sense that the 
Hausdorff dimension of their graph is strictly greater than one. Besicovitch and 
Ursell [16] showed that, if / is any function satisfying a Lipschitz condition of order 
5 E (0,1], then 

1 < dim H (f) < 2 - 6, (7.2) 

where dim#(/) denotes the Hausdorff dimension of the graph of /. They show that 
within these bounds any dimension is possible. The implications of their work for 
the Takagi class are collected in the following theorem. 

Theorem 7.2 (Besicovitch and Ursell, 1937). Let f(x) = Y,7=o 2~ San <fr(2 a "x) , where 
< 5 < 1 and a n+ i — a n > A > 0. Then f is Lipschitz of order 5 but of no smaller 
order, and: 

(i) Ifa n+1 /a n -> oo, then dim H (f) = 1. 

(ii) If a n = fi n where /i > 1, then 1 < dim^(/) < 2 — 5. Moreover, for each 
d G (1,2 — 5) there exists \x > 1 such that, if a n = fi n , then dim^(/) = d. 

(Hi) If a n+ i/a n — > 1 but a n+ i — a n — > oo, then dim#(/) =2 — 5. 

Examples of (i), (ii) and (iii) are, respectively: a n = 2™ 2 , a n = 2 n , and a n = n 2 . 
Note that in all three cases the series fl7.ll) is lacunary. Generally speaking, the more 
lacunary the series, the smaller the dimension of the graph is. When the series is 
extremely lacunary as in (i), the fine structure of the graph virtually disappears and 
the function / becomes "almost differentiable" . But the above theorem does not say 
anything about the important (nonlacunary) case a n = n. In other words, it does 
not give the dimension of the functions 

oo 

gs (x) = 2^"0(2 n x), < 5 < 1. (7.3) 

n=0 

This boundary case was addressed more than half a century later by Ledrappier, 
using modern tools that were not yet available to Besicovitch and Ursell. 
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Here too g$ is Lipschitz of order S. Ledrappier showed that typically, g$ attains 
the upper bound in (17. 2p . More precisely, he proved that dim H (g$) = 2 — 5 whenever 
2 5_1 is an Erdos number. A number A G (0, 1) is called an Erdos number if the prob- 
ability distribution of ^^1 \ n s n (the so-called Bernoulli convolution) has Hausdorff 
dimension 1, where {e n } are i.i.d. random variables taking the values 1 and —1 each 
with probability 1/2. Later progress on Bernoulli convolutions due to Solomyak [TO] 
implies that almost every A G (1/2,1) is Erdos, and hence, by Ledrappier's result, 
dim H (gs) = 2 — 5 for almost every 5 G (0, 1). Ledrappier's approach is dynamical, 
viewing the graph of g$ as the repeller for some expanding self-map of [0, 1] x R. The 
deep and difficult proof uses ideas from smooth ergodic theory and a Marstrand-type 
lemma concerning projections of measures. 

Recently, Katzourakis [37J has reported that, with a n = 2vn for a parameter 
v G N, the function / in Theorem 17.21 can be used to construct a "pathological" 
solution to the nonlinear Aronsson partial differential equation and to the Infinity- 
Laplace PDE system. 

For 5 > 1, the function g$ defined by f lT.3 j) is Lipschitz and hence almost every- 
where differentiable. When 1 < 5 < 2, g$ turns out to be the extremal function in a 
certain approximate convexity problem; see Section 18.31 below. 

After the publication of Hata and Yamaguti's paper, Kono jH] investigated the 
Takagi class in greater generality. Perhaps the most striking result, concerning the 
differentiability of /, is the following: 

Theorem 7.3 (Kono 1987). Let f be defined by (17. ip . and put a n := 2 n c n . 

(i) If {a n } G £ 2 , then f is absolutely continuous and hence differentiable almost 
everywhere. 

(ii) If {a n } G" t 2 but lim^oo a n = 0, then f is nondifferentiable at almost every 
point of [0, 1], but f is differentiable on an uncountably large set, and the range of f 
is E. 

(iii) If lim sup^^ \a n \ > 0, then f is nowhere differentiable. 

Kono [H] also considers the oscillations of / (stating more general forms of The- 
orems |23] and E3J) , and proves furthermore that the Takagi class contains only one 
function which is smooth in Zygmund's sense. That is, if 

f(x + h) + f(x -h) -2f(x) = o(h) ash 10 (7.4) 

for all x G (0, 1), then c n = a/4 n for some constant a, and f(x) = 2aa;(l — x). 

A special case of the Takagi class arises when one takes c n = ±l/2 n for all n in 
(17. ip . Precisely, let r = (r , ri, . . . ) be a sequence with r n G { — 1,1} for each n, and 
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Figure 4: The alternating Takagi function 



define 

oo 
n=0 

For example, the alternating Takagi function 

n*) = (7-5) 

n=0 

is of the above form; see Figure HI This gives an uncountably large class of functions 
which are "close" to the Takagi function T in the sense that their partial sums are 
all piecewise linear with integer slopes that change by ±1 at each step. It should 
therefore be no surprise that these functions share a large number of properties with 
T. For instance, Allaart Section 5] shows that many of the results from Section [6] 
concerning level sets hold for arbitrary F r : Almost all level sets of F r are finite, but 
their average cardinality is infinite and the set of ordinates y with uncountably large 
level sets is residual in the range of F r . 

An interesting random version of the Takagi function is obtained by taking the 
components of r to be independent random variables with P(r n = 1) = p and 
P(r n = —1) = 1 — p, where < p < 1. The maximum value M of F r is then 
a random variable, and the set M. := {x G [0, 1) : F(x) = M} is a random set. 
Allaart [2\ determines the probability distributions of M and the size of Ai. If 
p < 1/2, the distribution of M is purely atomic and \A4\ is almost surely finite, 
with range {2 l (2 m — 1) : I £ Z + ,m e N}. (For instance, one can have exactly 24 
maximum points with positive probability.) If p > 1/2, the distribution jj, of M is 
singular continuous, and A4 is a Cantor set with almost-sure Hausdorff dimension 
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(2p — l)/2p. In the latter case, Allaart also determines the Hausdorff dimension and 
multifractal spectrum of \i. 

7.3 The Zygmund spaces A^, X* d and A* dl 

In 1945, Zygmund [82J introduced the class A* of "smooth" functions of period 1, 
i.e. those functions / satisfying (I7.4p uniformly in x, and the wider class A* of 
"quasismooth" functions of period 1, which satisfy (I7.4p with 0(h) replacing o(h). 
Zygmund studied various properties of functions in these classes, and characterized 
them in terms of uniform approximation by polynomials. 

In 1989, Anderson and Pitt [12] introduced the larger classes A^ and A^ of periodic 
functions satisfying a weaker form of smoothness defined in terms of differences over 
dyadic intervals. These classes have simple characterizations in terms of the Schauder 
expansions of their members. Recall that every continuous function / of period 1 
which vanishes at has on [0, 1) a unique Schauder expansion of the form (15.21) . For 
easier comparison with the Takagi class, we will write the Schauder expansion in the 
form 

n=0 

where r n (x) depends only on the first n binary digits of x. More precisely, r n (x) = 
i2n(ei, • • • , e n ), where x = Y^=Q^~ n£ n an d s n G {0, 1}. The classes AjJ and A^ are 
defined as follows: / G A^ if and only if there exists a uniform bound M such that 
|r n (x)| < M for all n and all x\ and / G A^ if and only if r n (x) — > uniformly in x. 

Anderson and Pitt [12] show that A* C A^ and A* C A^. The Takagi function is 
an example of a function which is in A^ but not in A*. On the other hand, as shown 
by Abbott et al. [TJ, the alternating Takagi function T belongs to A*. In general, 
functions in A* can have corners but no cusps, whereas functions in A^ can have 
logarithmic cusps. It is shown in [12] that every member / of A^ is Lipschitz of order 
h\og(l/h). That is, there is a constant C such that, for all0<x<x + /i<l with 
h sufficiently small, 

\f( x + h)-f(x)\<Ch\og(l/h). 

Another result in [12] is that the graph of every / G A^ is of cr-finite linear measure 
(see Section [3]). With regard to level sets L(y) = {x G [0, 1) : f(x) = y}, Anderson 
and Pitt prove that (i) if / G A^, then L(y) is countable for almost every y; and (ii) 
if / G A^, then L(y) is finite for almost every y. It does not appear to be known 
whether the condition f E A* d gives enough regularity to the graph of / in order that 
L(y) be finite for almost all y. 

The specific case of (17. 6p where |r n (x)| is constant in x for each n was studied 
by Allaart [3]. We shall call the collection of such functions the flexible Takagi class. 
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Figure 5: A smooth function in the flexible Takagi class, defined by (17. 7p . 



It is an immediate generalization of the Takagi class in which the individual "tents" 
at each level can point either upward or downward, but all tents within a given 
level have the same amplitude. This guarantees uniformity in the fine structure 
across the domain of /, while allowing for a wide variety of general shapes of the 
graph. Indeed, Allaart [5] manages to extend all of Kono's results to this more 
general setting. Specifically, statements (i)-(iii) of Theorem 17.31 hold when a n = \r n \. 
Whereas the Takagi class contains (up to a multiplicative constant) only one function 
in the Zygmund space A*, the flexible Takagi class contains many. For example, it 
contains the "bell-shaped" curve 



depicted in Figure [5] and obtained by setting r = 1, T\ = 0, and r n = — 2 2 ~ n XiX 2 
for n > 2, where X n (x) = (— l) £n ^ is the nth Rademacher function. See [3j Section 
4] for more examples. Whether all functions in the flexible Takagi class that belong 
to A* must be piecewise quadratic remains open. 

A subcollection of the flexible Takagi class in which \r n \ = 1 for each n was 
studied by Abbott, Anderson and Pitt [1J, who denote this subcollection by h* dl . It 
contains the Takagi function, as well as several other interesting functions that have 
occurred in the literature. For instance, the Gray Takagi function of Kobayashi [42J , 
which plays a role in the analysis of Gray code digital sums (see Section I57T1 below) . 
belongs to X* dl . It has r n = X n for each n. Another example is the function T 3 of 



Kawamura [38J, which has a connection with certain self-similar sets in the complex 

plane. It has r n = X% • • • X n for each n. These functions are shown in Figure |6j 




(7.7) 
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Figure 6: The Gray Takagi function (left) and Kawamura's T 3 (right) 



For a continuous function / on [0, 1), define the dyadic difference quotients D n f 
as follows. If x G [0, 1), let k be the integer such that k/2 n < x < (k + l)/2 n , and 
put 

nj^UmzM.^^) _(*)]. 

(For the Takagi function, D n f(x) = D n (x) as in Section l27Lj see ( 12. ip .) Abbott, 
Anderson and Pitt [1] consider the set Sk of slow points with constant if, that is, 
the set 

Sk ■= S K {f) := {x e [0, 1) : | £>„/(*) I < K for all n}. 
They show that for /e A^, the Hausdorff dimension of 5*x is given by 

dinifl- Sr- = 1 + log 2 ycos 

Moreover, their proof shows that Sk is an s-set, with s being the above dimension. 
In particular, the set Uk=i Sk of all slow points of / has full Hausdorff dimension 1. 
On the other hand, for f e A* dl the set of slow points is a null set, because {D n f} n 
is a simple random walk and hence obeys the law of the iterated logarithm. The 
authors of [TJ are particularly interested in the interplay between slow points and 
local Lipschitz properties of a function. Let L = L(f) denote the set of points x at 
which / satisfies a local Lipschitz condition; that is, those points x for which there 
exists a constant M such that \f(x) — f(y)\ < M\x — y\ for all y ^ x. Abbott, 
Anderson and Pitt show that any /gAJj satisfies a local Lipschitz condition at 
most points of Sk = Sx(f) in the sense that dim^SV fl L) = dimn Sk- For the 
Takagi function and the alternating Takagi function the stronger statement Sk C L 
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2(K + 1) 
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holds, but this is exceptional: If a member / G A* [1 is chosen "at random", then 
'H°'(Sk^L) = with probability 1, where a = dim H Sk- Since H a (SK) > 0, this can 
be interpreted as saying that the "typical" / G satisfies a Lipschitz condition at 
very few of its slow points. 



7.4 The functions of Sekiguchi and Shiota 

Following the important paper of Hata and Yamaguti [31], Sekiguchi and Shiota 
studied further generalizions of the Takagi function. Their first result concerns the 
system of difference equations 

/ - (i - o/ U) - r} a + 1 



2 n+i j \ /J \2 n J \ 2" J (7.8) 

j = 0,l,...,2"-l, n = 0,l,2,..., 

where r G (0, 1) is a constant parameter. Let L r (x) be Lebesgue's singular function, 
and define the generalized Schauder function S r : R — > [0, 1] by 



SJx) 



L r (x)/r, if0<x<l/2 
(l -L r (x))/{1 -r), ifl/2<x<l, 



and S r (x + 1) = S r (x) for all x G R. Sekiguchi and Shiota show that the system 
( 17. 8p has a unique continuous solution / on [0, 1] if and only if l c ™l < °°' ^ n 

which case 

oo 

f(x) = /(0) + (/(l) - f(0))L r (x) +Y,°nS r (2 n x). 

n=0 

The second result of [67] is that L r (x) is an analytic function of r, and a recursive 
construction is given in terms of the functions S r (x) and Rademacher functions of 
the (normalized) kth partial derivative 

Sekiguchi and Shiota use martingale theory (based on the connection of L r (x) with 
unfair coin tossing explained in Section [5]) to prove their results. They show moreover 
that the functions T r ^ satisfy the system of difference equations 

rJ^-(i-r)rJ4W-/ ,+ 1 



2«+i / v V 2 n / V 2 n 



rp T ( 3 

J r,fc-1 I „„ — J-r,h-\ 



ii 
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for k > 1, n > and j = 0, 1, . . . , 2 n — 1, where we set T T $ := L r . Note that by ( 15. 6p . 
Ti/2,i = 2T. 

The special case of the functions T r ^ in which r = 1/2 was investigated further 
by Allaart and Kawamura [?]. In that paper we show that Ti/ 2) „(1 — x) — Ti/2,n(^) 
when n is odd, and Ti/2,n.(l — = — Ti/ 2)n (a;) when n is even. We derive from ( 15.71) 
and (17.91) the representation 




< x < 1. 



This is then used to prove that for each n, Ti/ 2>n is nowhere different iable and Holder 
continuous of order hi log(l//i)) n . That is, there is a constant C n such that, for 
0<x<x + h<l and h sufficiently small, 

\T 1/2in (x + h)-T l/2in (x)\ < C n h(\og(l/h)) n , 

and this bound is the best possible. As a result, the graph of Tu 2in has Hausdorff 
dimension 1. We also determine the global and local extrema of 7i/ 2) 2 and Ty^. The 
sets of points where these functions attain there absolute maximum are shown to be 
Cantor sets of Hausdorff dimension zero, and their members have binary expansions 
that follow a remarkable pattern. By contrast, we conjecture that T\/2,n has only 
finitely many absolute maximum points when n > 4. Regarding the growth rate of 
M n := maxo<3;<iTi/2,n, we conjecture that 

2 n 

M n — , as n ^ oo. (7.10) 

v 7ln 

(Proposition 6.25 in [7] shows that M n grows at least this fast.) 

For general r G (0,1), the functions T rn have not yet been thoroughly inves- 
tigated. However, a new paper by de Amo et al. jTU] announces the surprising 
result that if r ^ 1/2, then T rn is different iable (with vanishing derivative) almost 
everywhere for each n. This is in sharp contrast with the fact that T r n is nowhere 
differentiable if r = 1/2. One might expect a further study of these functions to 
reveal many more interesting properties. 

7.5 Other generalizations and variations 

Many other generalizations and variations of the Takagi function have been studied. 
We briefly mention a few, but the list below is by necessity far from complete. 

Frankl et al. [28] define a generalized Takagi function using arbitrary base 1 + c 
instead of base 2, where c is any positive real number. These functions are used 
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in connection with the Kruskal-Katona theorem in combinatorics; see Section 18.21 
below. The resulting graphs look like the graph of T with the wind blowing in from 
the side. Another way to create skewed versions of the Takagi function is of course to 
take the composition T o h of T with an arbitrary homeomorphism h : [0, 1] — > [0, 1]; 
Kawamura [39J considers the case where h = L~ x and looks at the set of points 
where T oh has a vanishing derivative. Tsujii [78] constructs a Takagi-like function 
of two variables. Sumi [72] observes that Lebesgue's singular function L a (x) can 
be interpreted as the probability of "tending to +oo" in a certain kind of random 
dynamics in the real line. He then extends this notion to random dynamics in the 
complex plane, obtaining a complex version of Lebesgue's singular function and (by 
extending the relationship (15. 6p ) a complex version of the Takagi function. 

For the more general setting where the tent map is replaced with an arbitrary 
periodic Lipschitz function, see Mauldin and Williams [59J and the many papers 
citing this work. 

7.6 Open problems 

There are many natural questions regarding the functions which have occurred in 
this section. Some of the problems listed below may have a known answer, some 
have not yet been seriously investigated, and others may be hard. 

Problem 7.1. Characterize exactly which functions in the flexible Takagi class be- 
long to Zygmund's space A*. (A partial characterization is given in [3j Theorem 4.1].) 
Must these functions necessarily be piecewise quadratic? 

Problem 7.2. The classes A* and A* ll are both contained in A£, but they are not 
disjoint. (For instance, the alternating Takagi function T belongs to both.) Can one 
characterize the intersection A* nA^? 

Problem 7.3. Is it true that for every / G A^ 1; the level set Lf(y) := {x G [0, 1] : 
f(x) = y} is finite for almost every yl 

Problem 7.4. A Besicovitch function is one that does not possess a one-sided (fi- 
nite or infinite) derivative at any point. Does the flexible Takagi class contain any 
Besicovitch functions? 

Problem 7.5. At which points do the Takagi- van der Waerden functions f r possess 
an infinite derivative? (See [8] for the case r = 2.) 

Problem 7.6. Determine the level set structure of f r for r > 3. In particular, is the 
level set of f r at almost every level y finite? 
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Problem 7.7. Prove or disprove that T\/2, n has a unique global maximum point and 
a unique global minimum point when n > 4. 

Problem 7.8. Prove or disprove (17.101) . 



8 Applications 

In this final section, we discuss a number of applications of the Takagi function and 
its generalizations. We begin with an application to the digital sum problem in 
number theory. 

8.1 Applications in Number theory 

Let n be a positive integer with binary expansion n = a i( n )^} where ai(n) e 
{0, 1}. We define the following arithmetical sums: 



oc 



s n 



yjaj(n), (the binary digital sum), (8.1) 

i=0 

N-1 

S k (N) = s ( n ) k , ( the P° wer sum )> ( 8 - 2 ) 

n=0 
N-1 

F(£, N) = Y e ?s(n) , (the exponential sum), (8.3) 



n=0 



where k, N are positive integers, and £ is a real number. Note that s(n) is the number 
of ones in the binary expansion of n. As a special case, F(log2, N) is the number of 
odd numbers in the first N rows of Pascal's triangle. 

The power and exponential sums of digital sums were first studied in connection 
with divisibility problems involving factorials and binomial coefficients, but they oc- 
cur in other areas of mathematics as well: algebraic number theory, topology, com- 
binatorics and computational algorithms. For more information, see Stolarsky [71] . 

If N is a power of 2, it is clear that 

Sl ( N ) = Nlo ^ N } N) = N l °^ 1+e *\ 

However, it is difficult to get an explicit formula for arbitrary iVeR 

The first author to give an exact expression for Si(N) was Trollope [77J in 1968. 
Delange [25] gave a simpler proof and extended the result to digits in arbitrary bases. 
Trollope's expression is 

Si(N) = Nlo ^ N +E(N), (8.4) 
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where the error term E(N) involves the Takagi function: if N is written as N = 
2 m (l + x) with m E 7L+ and < x < 1, then 

E(N) = T 1 - 1 {2x - T(x) - (1 + x) log 2 (l + x)} . (8.5) 

We can easily derive this from ( 14.71) and (15.11) . First, note that (14.71) can be written 

T (—\ = H^-J—s(k) 

\ 2 m / 2 m 2 m— 1 
So if N is expressed as above, then N < 2 m+1 and 

S(N) = + - 2 m T f 1 + X 



as 



= 2™- 1 (m(l + x) + 2x - T(x)) , 
where the last step follows since T satisfies (15.11) . Since 

^|^ = 2- 1 (l + x)(m + log 2 (l + x)), 

([USD follows. 

It is natual to ask if we can generalize the expressions (18.41) and (I8.5P to arbitrary 
k e N. For k = 2, Coquet |23j obtained an explicit formula and proved that S2(N) 
also has a close relationship with a nowhere differentiable function. Unfortunately, 
the general formula for Sk(N) given by Coquet included a function specified only by 
a complicated recursion. 

It seemed quite difficult to find a direct formula for Sk(N) with k > 3, but finally, 
in 1995, Okada et al. [61] gave the complete answer not only for the power sum but 
also for the exponential sum. The key was to connect F(£, N) with Lebesgue's 
singular function L r (x). Set r = (1 + e^) 1 and t = log 2 N. Denote the integer part 
of t by [t], and the fractional part by {t}. Then 

H^N) = ^L r (-^), (8.6) 



for £ G R and N <E N. Using Salem's expression for L r (x) (See [61]), an explicit 
expression for F(£, N) is obtained. 

For Sk{N), observe that the following equality holds for fceN: 



1 d k 

• (8-7) 
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An essential part of this derivative is (d k /dr k )L r (x)\ r= i/2, and hence, Sk{N) can be 
expressed explicitly in terms of the nowhere differentiable function Ti/ 2 k of Section 

Later, several analogous problems were studied. For instance, instead of the 
binary expansion of natural numbers, Kobayashi [12] considered the Gray code, which 
is an encoding of natural numbers as sequences of O's and l's with the property 
that the representations of adjacent integers differ in exactly one position. Though 
the Gray code was introduced initially as a solution to a communications problem 
involving digitization of analogue data, it has since been used in a wide variety of 
other applications, including databases, experimental design and even puzzle solving; 
see Savage [65J. 

Kobayashi defined a probability measure jx r on [0, 1] analogous to the binomial 
measure, but whereby each Gray code digit (rather than binary digit) of x G [0, 1] 
is with probability r and 1 with probability 1 — r, independently of the other 
digits. Adapting the methods of [H7J and [5T], Kobayashi gave explicit expressions 
for the Gray digital sum, the Gray power sum and the Gray exponential sum, which 
are defined just as in (|8.ip -( 1Q]) . but with Gray code digits replacing binary digits. 
The expressions are in terms of the distribution function L r of \x r and a nowhere- 
differentiable continuous function T which Kobayashi calls the Gray Takagi function; 
see Section [7731 

Kriippel [16] modified the Trollope-Delange formula in a different direction, con- 
sidering instead of s(n) the alternating binary sum s(n) = X^o( — l)* a i( n )- He 
derived an expression for Si(N) = Y2n=o ^( n ) m terms of the alternating Takagi 
function T defined in (17.51) . 

A comprehensive review of the role of nowhere-differentiable functions and sin- 
gular measures in the study of digital sum problems is found in Kobayashi et al. 

031 . 



8.2 Applications in Combinatorics 

A few interesting connections between the Takagi function and the discrete isoperi- 
metric problems in combinatorics have been discovered. 

The first surprising result was given by Frankl et al. |28j in 1995. Let (J) denote 
the collection of subsets of N having k elements. For any family T C (, ) and positive 
integer / < k, let the /th shadow of T be 

A,(JF) : = {G G ( N J : 3F G 7 si. G C F}. 



43 



It is clear that |Ai(J-")|, the size of the shadow, depends on J 7 . Thus, it is natural 
to ask for a fixed m G N, which family T such that \J-\ = m attains the minimum 
size of the shadow. This problem can be viewed as a special case of the vertex 
discrete isoperimetric problem by considering a graph whose vertices correspond to 
finite subsets of ( fc ) , with an edge connecting a set of cardinality k to each of its 
subsets of cardinality k — 1. 

The shadow minimization problem was solved by Kruskal and Katona indepen- 
dently. For A,Bg (^) , define the colex order by 

A < co iex B -<=>- max{a G A\B} < max{6 G B\A}. 

Denote by Colex(k, m) the family of the first m elements in ( fc ) with the colex order. 
The Kruskal-Katona theorem says that for all T C (^) such that \T\ = m, and / < k, 

minlA^J 7 )! = \Ai(Colex(k,m))\. 

Finally, define the Kruskal-Katona function by 

Ki(m) = —m + \Ai(Colex(k,m))\. 

Frankl et al. show that, properly normalized, K^m) converges uniformly to T(x) if 
/ = k — 1. More precisely, define the shadow function Sk by normalizing Kf, where 
l = k-V. 



S k (x) := k[ 2k k KU 



x 



for < x < 1. 



2k - V 
k 

Then Sk —> T uniformly on [0, 1]. 

In general, an estimation of the Kruskal-Katona function is hard to calculate. The 
result above clearly shows the reason. Frankl et al. exploit the above relationship 
and various known properties of the Takagi function to give new estimates of the 
Kruskal-Katona function. They also define a generalized Takagi function (using the 
base 1 + c expansion of x G [0, 1] for an arbitrary real c > 0) and show that it has a 
similar connection to the minimum shadow size for / = \ ck\ . 

Another connection of the Takagi function with combinatorics was given by Guu 
[50] . Assume a graph (V, E) with a set of vertices V and a set of edges E is given. 
For S <ZV, let 0(S) denote the number of edges in E which connect a vertex in S to 
a vertex in V\S. For given < k < |V|, the edge discrete isoperimetric problem is 
to minimize 0(S) over all S having k elements. Guu considers as a special case the 
n-cube Q n = (V n ,E n ), and defines the function 

6(n, k) = mm{6(S) : S C V n , \S\ = k}. 

He shows that 

Inn ™ = r(x). 
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8.3 Applications in Real Analysis 



In this subsection we sketch two applications of the Takagi function in real analysis. 
One concerns the zero sets of continuous nowhere-differentiable functions, while the 
other relates to the study of approximate convexity of real functions. 

8.3.1 Zero sets of continuous nowhere-differentiable functions 

If a continuous function does not have a finite derivative anywhere, what can one 
say about its set of zeros? This question was answered in 1966 by Lipinski [53], who 
gave the following characterization. 

Theorem 8.1. Let C C [0, 1]. Then C is the zero set of some nonnegative continuous 
nowhere-differentiable function f : [0, 1] — > R if and only if C is closed and nowhere 
dense. 

It is easy to see that the condition on C is necessary. To prove its sufficiency, 
Lipinski used the Takagi function to construct an example of a function / with the 
required properties. To begin, write [0, 1]\C = {J^ = i( a n,b n ) with the union disjoint. 
(If [0, 1]\C is a finite union of open intervals, the construction is easy.) Define 
functions 



where < a < b < 1. Thus, the graph of T a ^ is a smaller copy of the graph of T 
confined to the interval [a, £>]. It is tempting to construct / by putting 



Indeed, Schubert [66] had mistakenly believed that this creates a nowhere differen- 

tiable function. But, as Lipinski points out, / defined this way can be differentiable 
(with f'(x) = 0) at many points of C. In fact, if A(C) > 0, then / is differentiable 
almost everywhere on C. To correct this problem, Lipinski slightly modified the 
above construction as follows. Enumerate the dyadic open subintervals of [0, 1] (i.e. 
those of the form (j/2 n , (j + l)/2 n )) in some arbitrary order as {A n : n G N}. For 
each n, let F n be any interval (aj n , bi n ) contained in A n if such an interval exists, and 
F n = otherwise, with the restriction that no interval (eij,&j) is chosen more than 
once. Now define a function / by 




a < x < b, 




0, x G C 

(x), x G (a n , b n ), n G N. 





x G C 

x G (a;A), (ai,bi) G" {F n } 
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Lipinski shows that / is continuous and nowhere differentiable, and hence has the 
desired properties. Of course, in this construction we can replace T with any contin- 
uous nowhere-differentiable function which is strictly positive in (0, 1) and vanishes 
at and 1. 

8.3.2 Approximate convexity 

Our next application is to approximate convexity of real functions. Let V be a convex 
subset of a normed space and let e > 0, p > be given constants. Say a function 
/ : V — >■ R is (e,p)-midconvexii 

Jx + y\ ^ f(x) + f(y) , cT/ 

It was shown by Hazy and Pales [32J that if / : V — > E is continuous and (e, 1)- 
midconvex, then for all x, y G V and t G [0, 1], 

/(to + (1 - t)y) < tf(x) + (1 - *)/(y) + eu(t)\\x - y||, (8.8) 

where u = 2T. A natural question is whether u in this last inequality can be replaced 
by a smaller function. The negative answer came a few years later, when Boros [T8] 
proved that u is itself (1, l)-convex. In terms of T, this comes down to the inequality 

x-y 
2 

(To see that this implies the minimality of u in (18. 81) . take / = eu, x — 1 and y = 0.) 
Boros' proof was somewhat laborious, with no fewer than eight separate cases in the 
induction step. But the important thing was that it confirmed Pales' conjecture of 
the minimality of u. 

The next question, then, is what function takes the role of uj when 1. More 
precisely, what is the smallest function uj p such that, whenever / : V — > K is {e,p)~ 
midconvex, we have 

f(tx + (1 - t)y) < tf{x) + (1 - t)f(y) + su p (t) \\x - y\\ p (8.9) 

for all x, y G V and t G [0, 1]? For p G [1,2], Tabor and Tabor [75], [7J] showed that 
the answer is the function 

OO j 

^) = 2E^( 2Bi ). ( 8 - 10 ) 

n=0 



T(x) + r(y) | 
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which belongs to the Takagi class. In [73] they show that any (e,p)-midconvex 
function satisfies (I8.9p . and in [71] they establish that u p is itself (l,p)-midconvex; 
that is, 

u J*_±y)< ^+»M + \ x -«\*. (s.ii) 



Note that U2(x) = 4x(l — x) for x G [0, 1], and oj\(x) = 2T(x). Tabor and Tabor 
prove (18.111) first for p = 2, and then deduce it for all p G [1,2] by expressing uj v as 
an infinite series in terms of u%. A different proof of their result (which includes that 
of Boros) is given by Allaart [4] , who derives the formula 

T7J\ m - ln - 1 C_1^i(fe) 

fc=0 i=0 

where £i{k) G {0, 1} is determined by YH=o 2 l£ i{k) = k- Using this expression, (18. lip 
can be reduced to a simple inequality for weighted sums of binary digits, which has 
an easy induction proof; see [1]. 

Shortly after Tabor and Tabor's work appeared, Mako and Pales [56] proved 
a much more general result, which has the following remarkable consequence: For 
p G (0, 1], the minimal function u p in (18. 9ft is given by 

oo 1 

u p (x) = 2Y, ¥ m n x)Y- (8-13) 

n=0 

Comparing ( 18. 1Q|) and (18 . 13 j) . it is fascinating to see how at the boundary case p — 1, 
the exponent p "jumps" to the other factor in the series' summand. 

For more references to papers concerning approximate convexity, we refer to 



8.4 Connection with the Riemann Hypothesis 

To end this section, we briefly mention a recent result connecting the Takagi function 
with the Riemann Hypothesis. For a more detailed account, we refer to Lagarias [49] . 

Let F n be the nth Farey sequence (also called Farey series), that is, the set 
of irreducible fractions in (0, 1] with denominator less than or equal to n. Then 
\F n \ = $(n), where $(n) = Xl&KnV 9 ^)? f being the Euler totient function. The 
connection between Farey series and the Riemann Hypothesis is well documented and 
goes back to a 1924 paper by Franel [27] . Recently, Balasubramanian, Kanemitsu 
and Yoshimoto [14] showed, as an interesting example of their more general theory, 
that the Riemann Hypothesis is equivalent to the statement 

T(r) = -$(n) + 0(ni +£ ) for every e > 0. (8.14) 

r&F n 
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Note that the left hand side involves only function values of rational numbers, and 
can hence be computed by the method of Section 15.11 While it may seem unlikely 
that the Riemann Hypothesis will some day be solved by directly proving ( 18.141) . the 
connection is nonetheless surprisingly elegant and beautiful. 
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